首页> 外文OA文献 >Exploiting clustering algorithms in a multiple-level fashion: A comparative study in the medical care scenario
【2h】

Exploiting clustering algorithms in a multiple-level fashion: A comparative study in the medical care scenario

机译:以多级方式利用聚类算法:医疗场景中的比较研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Clustering real-world data is a challenging task, since many real-data collections are characterized by an inherent sparseness and variable distribution. An appealing domain that generates such data collections is the medical care scenario where collected data include a large cardinality of patient records and a variety of medical treatments usually adopted for a given disease pathology. This paper proposes a two-phase data mining methodology to iteratively analyze dierent dataset portions and locally identify groups of objects with common properties. Discovered cohesive clusters are then analyzed using sequential patterns to characterize temporal relationships among data features. To support an automatic classication of a new data objects within one of the discovered groups, a classication model is created starting from the computed cluster set. A mobile application has been also designed and developed to visualize and update data under analysis as well as categorizing new unlabeled records. A comparative study has been conducted on real datasets in the medical care scenario using diverse clustering algorithms. Results were compared in terms of cluster quality, execution time, classication performance and discovered sequential patterns. The experimental evaluation showed the eectiveness of MLC to discover interesting knowledge items and to easily exploit them through a mobile application. Results have been also discussed from a medical perspective.
机译:集群现实世界数据是一项具有挑战性的任务,因为许多现实数据集合的特征是固有的稀疏性和可变分布。产生此类数据收集的一个吸引人的领域是医疗保健场景,其中收集的数据包括大量患者记录的基数和通常用于给定疾病病理的各种医疗方法。本文提出了一种两阶段的数据挖掘方法,以迭代分析不同的数据集部分并在本地识别具有共同属性的对象组。然后使用顺序模式分析发现的内聚簇,以表征数据特征之间的时间关系。为了支持在发现的组之一中对新数据对象进行自动分类,从计算的群集集开始创建分类模型。还设计和开发了移动应用程序,以可视化和更新正在分析的数据以及对新的未标记记录进行分类。已经使用各种聚类算法对医疗场景中的真实数据集进行了比较研究。比较了结果的集群质量,执行时间,分类性能和发现的顺序模式。实验评估表明,MLC能够有效地发现有趣的知识项目,并通过移动应用程序轻松利用它们。还从医学角度讨论了结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号